Towards Intelligent Information Retrieval: An Overview of IR Research at U. Mass

نویسنده

  • W. Bruce Croft
چکیده

The Smart information retrieval project was started in 1961. During the past 30 years methods have been developed for the automatic term assignment to natural-language texts (automatic indexing), automatic document clustering, collection searching, and the automatic reformulation of search queries using relevance feedback. Many of these procedures have been incorporated into prac tical retrieval settings. Although there is no hope of solving the content analysis problem for naturallanguage texts completely satisfactorily, the possibility of automatically analyz ing very large text samples offers new approaches for automatic text processing and information retrieval. Some methods for the massive analysis of natural language text are outlined together with applications in information retrieval. 1 The Vector Processing System Conventional information retrieval systems are based on Boolean query formu lations where keywords are used together with connecting Boolean operators. By constructing large so-called inverted indexes that contain for each allowable keyword the lists of addresses of all documents indexed by that keyword, it is possible to determine the set of documents corresponding to a given Boolean query formulation from the information stored directly in the index. This im plies that rapid responses can be provided in a conventional retrieval setting using standard Boolean processing methods. The conventional Boolean search system does, however suffer from a number of serious disadvantages: First, the Boolean logic remains inaccessible to many Department of Computer Science, Cornell University, Ithaca, NY 14853-7501. This study was supported in part by the National Science Foundation under grant 1ST 84-02735.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XML Information Retrieval: An overview

—Locating and distilling the valuable relevant information continued to be the major challenges of Information Retrieval (IR) Systems owing to the explosive growth of online web information. These challenges can be considered the XML Information Retrieval challenges as XML has become a de facto standard over the Web. The research on XML IR starts with the classical IR strategies customized to X...

متن کامل

E.Coli Search: Self Replicating Agents for Web Based Information Retrieval

Although search engines are often used for information retrieval (IR) from the World Wide Web (WWW), current search engine technology seems obsolete. The quality of query results from today’s search engines is unacceptable, creating a demand for new information search and retrieval techniques. The conventional IR methods often lack the flexibility to adapt to changes in the content of the WWW. ...

متن کامل

Knowledge Reduction Information Retrieval Model in Pathology Medical Domain

We present an efficient intelligent information retrieval model using reduction of domain-specific expert knowledge, demonstrating its use in the pathology medical domain. We created an information retrieval model that incorporates domain-specific knowledge to provide knowledgeable answers to users. This model converts domain-specific knowledge to a relationship of terms represented as quantita...

متن کامل

Learning in Intelligent Information Retrieval

Information retrieval (IR) systems are used for nding, within a large text database, those documents containing information needed by a user. The complex and poorly understood semantics of documents and user queries has made feedback and adaptation important characteristics of IR systems. In this paper we brieey survey previous research on machine learning in IR systems and discuss promising ar...

متن کامل

Image Information Retrieval: An Overview of Current Research

This paper provides an overview of current research in image information retrieval and provides an outline of areas for future research. The approach is broad and interdisciplinary and focuses on three aspects of image research (IR): text-based retrieval, content-based retrieval, and user interactions with image information retrieval systems. The review concludes with a call for image retrieval...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Data Eng. Bull.

دوره 13  شماره 

صفحات  -

تاریخ انتشار 1990